TITLE OF THE INVENTION 
OBJECT MONITORING APPARATUS 
BACKGROUND OF THE INVENTION 
Field of the Invention 
5 This invention relates to an object monitoring apparatus 

including a camera. 
Description of the Related Art 

Japanese patent application publication number 11-044837 
discloses an automatic focusing device for a camera. The device in 
1 0 Japanese application 1 1-044837 includes a focal point detector. An 
image of an object is repetitively photoelectrically converted into an 
object image signal composed of pixel corresponding signals. The 
focal point detector outputs the object image signal to a motion 
prediction calculator. The motion prediction calculator stores the 

1 5 object image signal into an internal memory. 

In the device of Japanese application 11-044837, a motion 
deciding section divides the newest object image signal outputted 
from the focal point detector into blocks. The motion deciding 
section accesses the previous object image signal which is stored in 

2 0 the memory within the motion prediction calculator. The motion 

deciding section divides the previous object image signal into 
blocks. The motion deciding section evaluates the correlation 
between the newest object image signal and the previous object 
image signal on a block-by-block matching basis. The motion 
2 5 deciding section informs the motion prediction calculator of the 
evaluated correlation. The motion prediction calculator computes 



-2- 



the length, traveled by the object in a direction along the optical 
axis of a camera lens, on the basis of the evaluated correlation. A 
defocus calculator predicts the distance between the object and a 
camera, which will occur a predetermined time after the present 
5 moment, in response to the computed traveled length. A sequence 
controller drives the camera lens in response to the predicted 
distance to implement automatic focusing control. 

U.S. Patent No. 5,777,690 discloses a device for detecting 
moving objects. The device in U.S. Patent No. 5,777,690 includes 

10 an optical flow extracting unit for extracting optical flows for the 
respective local regions in the measured images, an focus of 
expansion (FOE) calculating unit for calculating an FOE of a straight 
line extended by the extracted optical flows, and a moving obstacle 
detecting unit for analyzing a temporal change of the calculated FOE 

15 to judge the presence of the moving obstacle when the temporal 

positional change is larger than a predetermined variation quantity. 

Japanese patent application publication number 5-332725 
discloses an apparatus including a lens, a diaphragm, and an imager. 
The diaphragm immediately follows the lens. The imager follows 

2 0 the diaphragm. The imager is movable relative to the lens. When 

the imager is in a position at which an image of an object is in focus, 
the size and shape of the image remains unchanged independent of 
a change of the diaphragm. On the other hand, when the imager is 
out of such an in-focus position, the size and shape of the image 

2 5 varies in accordance with a change of the diaphragm. In the 
apparatus of Japanese application 5-332725, an image signal 



outputted from the imager is processed while the imager is moved 
and the diaphragm is changed. Specifically, components of the 
image signal which represent edges in the image are monitored. 
The edge-representing signal components are used in deciding 
whether or not the size and shape of the image varies in accordance 
with the change of the diaphragm. The decision result provides 
detection of an in-focus position for the imager. 

A. Pentland et al reported a simple real-time range camera, 
1989 IEEE, pages 256-260. The Pentland's camera includes a 
simple imaging range sensor based on the measurement of focal 
error. Specifically, the error in focus is measured by comparing two 
geometrically identical images, one with a wide aperture, so that 
objects off the focal plane are blurred, with a small- aperture image 
where everything is sharply focused. The images are collected at 
the same time, so that scene motion is not a problem, and are 
collected along the same optical axis with the same focal length, so 
that there is no geometrical distortion. 

There is a conceivable visual monitor apparatus which is not 
prior art against this invention. The conceivable visual monitor 
apparatus is provided with a camera which includes a photoelectric 
conversion device, a lens located in front of the photoelectric 
conversion device, and an actuator for moving the lens relative to 
the photoelectric conversion device. Light passes through the lens 
before reaching the photoelectric conversion device and forming 
thereon an image of a scene extending in front of the camera. The 
photoelectric conversion device converts the image into a 



corresponding video signal. The photoelectric conversion device 
outputs the video signal. In the conceivable visual monitor 
apparatus, the actuator is controlled to periodically and cyclically 
change the distance between the lens and the photoelectric 
conversion device among three different values. According to the 
distance change, the plane on which the camera is focused is 
changed among three separate positions (first, second, and third in- 
focus positions). 

In the conceivable visual monitor apparatus, a first memory is 
loaded with the video signal representing an image which occurs 
when the first in-focus position is taken. A second memory is 
loaded with the video signal representing an image which occurs 
when the second in-focus position is taken. A third memory is 
loaded with the video signal representing an image which occurs 
when the third in-focus position is taken. The video signals in the 
first, second, and third memories are analyzed to find a common 
object contained in the images. In addition, the degree of focus for 
the object is calculated regarding each of the images. The 
calculated degrees of focus for the object are compared to decide 
which of the images corresponds to the best focus. The position or 
positional range of the object along an optical axis of the camera is 
determined on the basis of the in-focus position corresponding to 
the best-focus image. 

In the conceivable visual monitor apparatus, the actuator 
remains periodically and cyclically driven regardless of whether an 
object of interest is moving or stationary. Accordingly, the 
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conceivable visual monitor apparatus tends to consume power at a 
high rate. 

SUMMARY OF THE INVENTION 
It is an object of this invention to provide an improved object 
5 monitoring apparatus. 

A first aspect of this invention provides an object monitoring 
apparatus comprising a movable lens; first means for converting an 
image, represented by light passing through the lens, into a video 
signal; second means for detecting a moving object in an image 
1 0 represented by the video signal generated by the first means; third 
means for, when the second means detects a moving object, moving 
the lens to change an in-focus position, on which a combination of 
the lens and the first means is focused, among predetermined 
positions different from each other; fourth means for detecting 

1 5 degrees of focus of images represented by video signals which are 

generated by the first means when the in-focus position coincides 
with the predetermined positions respectively; fifth means for 
deciding a greatest of the focus degrees detected by the fourth 
means; and sixth means for indicating the video signal representing 

2 0 the image having the greatest focus degree decided by the fifth 

means. 

A second aspect of this invention provides an object 
monitoring apparatus comprising a movable lens; first means for 
converting an image, represented by light passing through the lens, 
2 5 into a video signal; second means for moving the lens to change an 
in-focus position, on which a combination of the lens and the first 
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means is focused, among predetermined positions different from 
each other; third means for analyzing frequencies of video signals 
which are generated by the first means when the in-focus position 
coincides with the predetermined positions respectively; fourth 
5 means for deciding a highest of the frequencies analyzed by the 
third means; and fifth means for indicating the video signal having 
the highest frequency decided by the fourth means. 

A third aspect of this invention provides an object monitoring 
apparatus comprising a movable lens; first means for converting an 
1 0 image, represented by light passing through the lens, into a video 
signal; second means for moving the lens to change an in-focus 
position, on which a combination of the lens and the first means is 
focused, among predetermined positions different from each other; 
third means for analyzing frequencies of video signals for each of 

1 5 different bands, said video signals being generated by the first 

means when the in-focus position coincides with the 
predetermined positions respectively; fourth means for detecting a 
frequency component difference among the video signals from 
results of said analyzing by the third means for each of the different 

2 0 bands; fifth means for deciding a greatest of the frequency 

component differences detected by the fourth means and 
corresponding to the respective different bands; sixth means for 
detecting frequency components in the respective video signals for 
the band corresponding to the greatest frequency component 
2 5 difference decided by the fifth means from the results of said 

analyzing by the third means; seventh means for deciding a highest 
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of the frequency components detected by the sixth means; and 
eighth means for indicating the video signal having the highest 
frequency component decided by the seventh means. 

A fourth aspect of this invention is based on the first aspect 
5 thereof, and provides an object monitoring apparatus wherein the 
first means comprises light receiving units arranged in a lattice, 
expansion-contraction members connecting the light receiving 
units, a CCD-based photoelectric conversion device for converting 
light received by the light receiving units into an electric signal, and 
1 0 means for expanding and contracting the expansion-contraction 

members to change an effective light receiving area covered by the 
light receiving units. 

A fifth aspect of this invention provides an object monitoring 
apparatus comprising a combination lens including segments having 

1 5 different focal points respectively; condensers for condensing light 

beams passing through the segments of the combination lens, 
respectively; first means for converting the light beams condensed 
by the condensers into video signals, respectively; second means for 
detecting frequency components in the video signals generated by 

2 0 the first means, respectively; third means for deciding a highest of 

the frequency components detected by the second means; and 
fourth means for indicating the video signal having the highest 
frequency component decided by the third means. 

A sixth aspect of this invention is based on the fifth aspect 
2 5 thereof, and provides an object monitoring apparatus further 
comprising an optical fiber cable for guiding the light beams 
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condensed by the condensers to the first means. 

A seventh aspect of this invention provides an object 
monitoring system comprising a set of object monitoring 
apparatuses arranged to monitor surroundings of a rectangle, 
5 wherein each of the object monitoring apparatuses includes the 
object monitoring apparatus of the fifth aspect of this invention. 

An eighth aspect of this invention provides an object 
monitoring apparatus comprising a camera generating a video 
signal; first means for deciding whether a moving object is present 

10 in or absent from an image represented by the video signal 

generated by the camera; second means responsive to a result of the 
deciding by the first means for, in cases where the first means 
decides that a moving object is present in an image represented by 
the video signal, changing an in-focus position, on which the camera 

15 is focused, among predetermined positions including at least first 
and second predetermined positions; third means for detecting a 
first degree of focus of an image represented by a first video signal 
which is generated by the camera when the in-focus position 
coincides with the first predetermined position; fourth means for 

2 0 detecting a second degree of focus of an image represented by a 

second video signal which is generated by the camera when the in- 
focus position coincides with the second predetermined position; 
fifth means for deciding a greatest of the first and second focus 
degrees detected by the third and fourth means; sixth means for 

2 5 selecting one from among the first and second video signals which 
represents the image having the greatest focus degree decided by 
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the fifth means; and seventh means for displaying the video signal 
selected by the sixth means. 

A ninth aspect of this invention is based on the eighth aspect 
thereof, and provides an object monitoring apparatus wherein the 
5 third means comprises means for subjecting the first video signal 
DCT to generate first DCT coefficients, means for summating 
squares of DCT coefficients selected from among the first DCT 
coefficients to generate a first summation result, and means for 
detecting the first focus degree in response to the first summation 
1 0 result; and wherein the fourth means comprises means for 

subjecting the second video signal DCT to generate second DCT 
coefficients, means for summating squares of DCT coefficients 
selected from among the second DCT coefficients to generate a 
second summation result, and means for detecting the second focus 

1 5 degree in response to the second summation result. 

BRIEF DESCRIPTION OF THE DRAWINGS 
Fig. 1 is a block diagram of a conceivable visual monitor 
apparatus. 

Fig. 2 is a block diagram of an object monitoring apparatus 

2 0 according to a first embodiment of this invention. 

Fig. 3 is a diagrammatic perspective view of an object and a 
camera in Fig. 2. 

Figs. 4, 5, and 6 are diagrams of images of the object in Fig. 3 
which are generated when the camera is focused on different 
2 5 positions, respectively. 

Fig. 7 is a diagrammatic perspective view of trespassers and 
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the camera in Fig. 2. 

Fig. 8 is a block diagram of an object monitoring apparatus 
according to a second embodiment of this invention. 

Fig. 9 is a diagram of a frame, a block, and a DCT- coefficient 
5 matrix. 

Fig. 10 is a block diagram of an object monitoring apparatus 
according to a third embodiment of this invention. 

Fig. 1 1 is a diagram of a DCT-coefficient matrix and a first 
example of a band region (a window region) set therein. 
1 0 Fig. 12 is a diagram of a DCT-coefficient matrix and a second 

example of the band region (the window region) set therein. 

Fig. 13 is a block diagram of an object monitoring apparatus 
according to a fourth embodiment of this invention. 

Fig. 14 is a diagram of an object, a lens, and images of the 

1 5 object which are formed on different projection planes respectively. 

Fig. 15 is a diagrammatic section view of a condenser in Fig. 

13. 

Fig. 16 is a diagram of an expansion- contraction member in 
Fig. 13. 

2 0 Fig. 17 is a block diagram of a portion of an object monitoring 

apparatus according to a fifth embodiment of this invention. 

Fig. 18 is a diagrammatic plan view of original lenses and a 
combination lens in Fig. 17. 

Fig. 19 is a block diagram of a portion of an object monitoring 
2 5 apparatus according to a sixth embodiment of this invention. 

Fig. 20 is a block diagram of an object monitoring apparatus 
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according to a seventh embodiment of this invention. 

Fig. 21 is a diagram of a vehicle provided with the object 
monitoring apparatus in Fig. 20. 

Fig. 22 is a block diagram of an object monitoring apparatus 
5 according to an eighth embodiment of this invention. 

Fig. 23 is a flowchart of a segment of a program for a signal 
processor in Fig. 22. 

Fig. 24 is a block diagram of an object monitoring apparatus 
according to a ninth embodiment of this invention. 
1 0 Fig. 25 is a flowchart of a segment of a program for a signal 

processor in Fig. 24. 

Fig. 26 is a flowchart of a block in Fig. 25. 

DETAILED DESCRIPTION OF THE INV ENTION 
A conceivable visual monitor apparatus will be explained below 

1 5 for a better understanding of this invention. 

With reference to Fig. 1, a conceivable visual monitor 
apparatus (not prior-art against this invention) includes a camera 
173. The camera 173 has a movable lens 171, an electrically- 
powered actuator 172, and a photoelectric conversion device 174. 

2 0 The lens 171 is located in front of the photoelectric conversion 

device 174. The actuator 172 operates to move the lens 171 
relative to the photoelectric conversion device 174. Light passes 
through the lens 171 before reaching the photoelectric conversion 
device 174 and forming thereon an image of a scene extending in 
2 5 front of the camera 173. The photoelectric conversion device 174 
converts the image into a corresponding video signal. The 



-1 2- 



photoelectric conversion device 174 outputs the video signal. The 
photoelectric conversion device 174 is of a CCD-based type. 

The conceivable apparatus of Fig. 1 further includes a signal 
distributor 175, a controller 176, a first memory 177, a second 
memory 178, a third memory 179, a signal processor 180, and a 
display 181. The signal distributor 175 is connected to the 
photoelectric conversion device 174 within the camera 173. In 
addition, the signal distributor 175 is connected to the controller 
176 and the memories 177, 178, and 179. The controller 176 is 
connected to the actuator 172 within the camera 173. The signal 
processor 180 is connected to the memories 177, 178, and 179, 
and the display 181. 

In the conceivable apparatus of Fig. 1, the controller 176 
includes a signal generator which produces a periodical control 
signal. The controller 176 outputs the produced control signal to 
the actuator 172 within the camera 173. Also, the controller 176 
outputs the control signal to the signal distributor 175. In the 
camera 173, the actuator 172 moves the lens 171 in response to 
the control signal fed from the controller 176. Specifically, the 
actuator 172 periodically and cyclically changes the distance 
between the lens 171 and the photoelectric conversion device 174 
among three different values. According to the distance change, the 
plane on which the camera 173 is focused is changed among three 
separate positions (first, second, and third in-focus positions). The 
first, second, and third in-focus positions are equal to the farthest, 
intermediate, and nearest positions as seen from the camera 173, 
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respectively. At least one complete image (a frame) is converted by 
the photoelectric conversion device 174 each time one of the first, 
second, and third in-focus positions is taken. 

In the conceivable apparatus of Fig. 1, the signal distributor 
175 receives the video signal from the photoelectric conversion 
device 174 within the camera 173. The signal distributor 175 
recognizes which of the first, second, and third in-focus positions is 
currently taken by referring to the control signal fed from the 
controller 176. Thus, the signal distributor 175 recognizes which 
of the first, second, and third in-focus positions an image currently 
represented by the video signal corresponds to. The signal 
distributor 175 includes a memory control device which acts on the 
memories 177, 178, and 179 in response to the control signal fed 
from the controller 176. When an image currently represented by 
the video signal corresponds to the first in-focus position, the signal 
distributor 175 stores the video signal into the first memory 177. 
When an image currently represented by the video signal 
corresponds to the second in-focus position, the signal distributor 
175 stores the video signal into the second memory 178. When an 
image currently represented by the video signal corresponds to the 
third in-focus position, the signal distributor 175 stores the video 
signal into the third memory 179. 

In the conceivable apparatus of Fig. 1 , the signal processor 
180 operates in accordance with a program stored in its internal 
ROM. The program is designed to enable the signal processor 180 
to implement processes mentioned later. The signal processor 180 
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accesses the video signals in the memories 177, 178, and 179. The 
signal processor 180 analyzes the video signals to find a common 
object contained in images represented by the video signals. The 
signal processor 180 calculates the degree of focus for the object 
regarding each of the images on a pixel-by-pixel basis. The signal 
processor 180 compares the calculated degrees of focus for the 
object, and decides which of the images corresponds to the best 
focus in response to the comparison results. The signal processor 
180 determines the position or positional range of the object along 
an optical axis of the camera 173 on the basis of the in-focus 
position corresponding to the best-focus image. 

In the conceivable apparatus of Fig. 1, the actuator 172 
remains periodically and cyclically driven regardless of whether an 
object of interest is moving or stationary. Accordingly, the 
conceivable apparatus tends to consume power at a high rate. 
First Embodiment 
Fig. 2 shows an object monitoring apparatus according to a 
first embodiment of this invention. The apparatus of Fig. 2 includes 
a movable lens 1, an electrically-powered actuator 2, and a 
photoelectric conversion device 3 provided in a camera or an image 
capturing device 4. The lens 1 is located in front of the 
photoelectric conversion device 3. The actuator 2 operates to move 
the lens 1 relative to the photoelectric conversion device 3. Light 
passes through the lens 1 before reaching the photoelectric 
conversion device 3 and forming thereon an image of a scene 
extending in front of the camera 4. The photoelectric conversion 
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device 3 converts the image into a corresponding video signal. The 
photoelectric conversion device 3 outputs the video signal. The 
photoelectric conversion device 3 implements periodical scanning 
so that the video signal represents a sequence of frames. The 
photoelectric conversion device 3 is of, for example, a CCD-based 
type. 

The apparatus of Fig. 2 further includes a signal distributor 5, 
a motion detector 6, a controller 7, a first memory 8, a second 
memory 9, a third memory 10, a signal processor 11, and a display 
12. The signal distributor 5 is connected to the photoelectric 
conversion device 3 within the camera 4. In addition, the signal 
distributor 5 is connected to the motion detector 6, the controller 
7, and the memories 8, 9, and 10. The motion detector 6 is 
connected to the controller 7. The controller 7 is connected to the 
actuator 2 within the camera 4. In addition, the controller 7 is 
connected to the signal processor 11. The signal processor 11 is 
connected to the memories 8, 9, and 10, and the display 12. 

The controller 7 includes a signal generator started by a 
trigger signal fed from the motion detector 6. The signal generator 
is deactivated by a turn-off signal fed from the motion detector 6. 
When being started, the signal generator produces a periodical 
active control signal. The controller 7 outputs the produced active 
control signal to the actuator 2 within the camera 4. Also, the 
controller 7 outputs the active control signal to the signal 
distributor 5 and the signal processor 11. When being deactivated 
by a turn-off signal fed from the motion detector 6, the signal 
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generator in the controller 7 does not produce the active control 
signal. 

In the presence of the active control signal outputted from the 
controller 7, the camera 4 operates as follows. The actuator 2 
moves the lens 1 in response to the active control signal. 
Specifically, the actuator 2 periodically and cyclically changes the 
distance between the lens 1 and the photoelectric conversion 
device 3 among three different values. According to the distance 
change, the plane on which the camera 4 is focused is changed 
among three separate positions (first, second, and third in-focus 
positions) PI, P2, and P3. It is shown in Fig. 3 that the first, 
second, and third in-focus positions PI, P2, and P3 are equal to the 
farthest, intermediate, and nearest positions as seen from the 
camera 4, respectively. At least one complete image (a frame) is 
converted by the photoelectric conversion device 3 each time one of 
the first, second, and third in-focus positions PI, P2, and P3 is 
taken. 

One of the three different values of the distance between the 
lens 1 and the photoelectric conversion device 3 is specified as an 
initial value or a normal value. In the absence of the active control 
signal outputted from the controller 7, the distance between the 
lens 1 and the photoelectric conversion device 3 remains equal to 
the initial value (the normal value). Accordingly, in the absence of 
the active control signal, one of the first, second, and third in-focus 
positions PI, P2, and P3 which corresponds to the initial distance 
between the lens 1 and the photoelectric conversion device 3 
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continues to be taken. This one of the first, second, and third in- 
focus positions PI, P2, and P3 is also referred to as the initial in- 
focus position or the normal in-focus position. Preferably, the 
second in-focus position P2 is used as the initial in-focus position. 
5 To enable the initial in-focus position to be taken in the absence of 
the active control signal, the actuator 2 is provided with a returning 
mechanism or a self-positioning mechanism. In the case where the 
active control signal outputted from the controller 7 remains 
absent, that is, in the case where the initial in-focus position 
1 0 continues to be taken, the photoelectric conversion device 3 

periodically converts an image formed thereon into a video signal. 

The signal distributor 5 receives the video signal from the 
photoelectric conversion device 3 within the camera 4. The signal 
distributor 5 passes the video signal to the motion detector 6. The 

1 5 motion detector 6 operates to detect an object motion in a stream 

of images represented by the video signal. When the device 6 
detects an object motion, the device 6 outputs a signal 
representative of the detected object motion to the controller 7 as a 
trigger signal. In the absence of a detected object motion, the 

2 0 motion detector 6 does not output any trigger signal to the 

controller 7. When a detected object motion disappears, the motion 
detector 6 outputs a turn-off signal to the controller 7. 

For example, the signal distributor 5 includes a programmable 
signal processor. In this case, the signal distributor 5 operates in 
2 5 accordance with a program stored in its internal ROM. The 

program is designed to enable the signal distributor 5 to implement 



processes mentioned later. The signal distributor 5 recognizes 
which of the first, second, and third in-focus positions PI, P2, and 
P3 is currently taken by referring to the active control signal fed 
from the controller 7. Thus, the signal distributor 5 recognizes 
which of the first, second, and third in-focus positions PI, P2, and 
P3 an image currently represented by the video signal corresponds 
to. The signal distributor 5 includes a memory control device which 
acts on the memories 8, 9, and 10 in response to the active control 
signal fed from the controller 7. In the case where the first, 
second, and third in-focus positions PI, P2, and P3 are periodically 
and cyclically taken by turns, the signal distributor 5 operates as 
follows. When an image currently represented by the video signal 
corresponds to the first in-focus position PI, the signal distributor 5 
stores the video signal into the first memory 8. When an image 
currently represented by the video signal corresponds to the second 
in-focus position P2, the signal distributor 5 stores the video signal 
into the second memory 9. When an image currently represented 
by the video signal corresponds to the third in-focus position P3, 
the signal distributor 5 stores the video signal into the third 
memory 10. On the other hand, in the case where the initial in- 
focus position (the second in-focus position P2) continues to be 
taken, the signal distributor 5 does not store the video signal into 
any of the memories 8, 9, and 10. 

Since the first, second, and third in-focus positions PI, P2, 
and P3 are spaced from the camera 4 by different distances 
respectively, a real same-size area forms different- size regions of 
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images represented by video signals corresponding to the first, 
second, and third in-focus positions PI, P2, and P3. The signal 
distributor 5 compensates for such image-size variation. 
Specifically, when an image currently represented by the video 
5 signal corresponds to the first in-focus position PI, the signal 

distributor 5 subjects the video signal to image-size correction to 
provide an equality with an image size corresponding to the second 
in-focus position P2. Then, the signal distributor 5 stores the 
correction-resultant video signal into the first memory 8. When an 
1 0 image currently represented by the video signal corresponds to the 
third in-focus position P3, the signal distributor 5 subjects the video 
signal to image- size correction to provide an equality with an image 
size corresponding to the second in-focus position P2. Then, the 
signal distributor 5 stores the correction-resultant video signal into 

1 5 the third memory 10. 

For example, the signal processor 1 1 is of a programmable 
type. In this case, the signal processor 1 1 operates in accordance 
with a program stored in its internal ROM. The program is 
designed to enable the signal processor 1 1 to implement processes 

2 0 mentioned later. The signal processor 1 1 decides whether or not 

the active control signal is being outputted from the controller 7. In 
other words, the signal processor 11 decides whether or not the 
first, second, and third in-focus positions PI, P2, and P3 are 
periodically and cyclically taken by turns, and hence decides 
2 5 whether or not the initial in-focus position (the second in-focus 
position P2) continues to be taken. In the case where the first, 



second, and third in-focus positions PI, P2, and P3 are periodically 
and cyclically taken by turns, the signal processor 1 1 accesses the 
video signals in the memories 8, 9, and 10. The signal processor 11 
analyzes the video signals to find a common object contained in 
images represented by the video signals. The signal processor 1 1 
calculates the degree of focus for the object regarding each of the 
images on a pixel-by-pixel basis. The signal processor 1 1 compares 
the calculated degrees of focus for the object, and decides which of 
the images corresponds to the best focus in response to the 
comparison results. The signal processor 1 1 transfers the video 
signal representative of the best-focus image from the related 
memory (the memory 8, 9, or 10) to the display 12. The signal 
processor 11 controls the display 12 to indicate the best-focus 
image represented by the transferred video signal. On the other 
hand, in the case where the initial in-focus position (the second in- 
focus position P2) continues to be taken, the signal processor 1 1 
does not access any of the memories 8, 9, and 10. 

With reference to Fig. 3, a moving object 22 has just reached 
the second in-focus position P2. In this case, as shown in Fig. 5, a 
frame represented by the video signal corresponding to the second 
in-focus position P2 contains an image of the object 22 which is in 
focus. On the other hand, as shown in Fig. 4, a frame represented 
by the video signal corresponding to the first in-focus position P2 
contains a fuzzy image of the object 22. Similarly, as shown in Fig. 
6, a frame represented by the video signal corresponding to the 
third in-focus position P3 contains a fuzzy image of the object 22. 
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With reference to Fig. 7, trespassers 31 and 32 come into the 
field 33 of view of the camera 4. When the motion detector 6 
detects motion of at least one of the trespassers 31 and 32, the 
device 6 outputs a trigger signal to the controller 7. The controller 
7 generates an active control signal in response to the trigger 
signal, and outputs the generated active control signal to the 
actuator 2, the signal distributor 5, and the signal processor 11. As 
a result, the camera 4 is operated in the mode where the first, 
second, and third in-focus positions PI, P2, and P3 are periodically 
and cyclically taken by turns. In addition, the device 5 distributes a 
video signal to the memories 8, 9, and 10. Furthermore, the signal 
processor 11 implements the previously-mentioned signal 
processing. Thus, the signal processor 1 1 selects and decides the 
best-focus image from among three images corresponding to the 
first, second, and third in-focus positions PI, P2, and P3. The 
signal processor 1 1 transfers the video signal representative of the 
best-focus image from the related memory (the memory 8, 9, or 10) 
to the display 12. The signal processor 11 controls the display 12 
to indicate the best-focus image represented by the transferred 
video signal. As a result, an image of the trespasser of interest is 
indicated on the display 12. 

The position of the trespasser of interest may be estimated in 
response to which of the first, second, and third in-focus positions 
PI, P2, and P3 the best-focus image corresponds to. In this case, it 
is preferable that only when the trespasser of interest (the 
trespasser 3 1 in Fig. 7) enters a specified area "A" centered at the 



second in-focus position P2, the signal processor 1 1 transfers the 
video signal representative of the best-focus image from the related 
memory (the memory 8, 9, or 10) to the display 12. Thus, only 
when the trespasser of interest enters the specified area "A", an 
image thereof is indicated on the display 12. 

Second Embodiment 
Fig. 8 shows an object monitoring apparatus according to a 
second embodiment of this invention. The apparatus of Fig. 8 is 
similar to the apparatus of Fig. 2 except for design changes 
mentioned later. 

The apparatus of Fig. 8 includes a signal processor 41, a signal 
generator 42, memories 43, 44, 45, and 46, a signal processor 47, a 
display 48, a signal generator 49, and a memory 50. The signal 
processor 41 is connected to the memories 8, 9, and 10. In 
addition, the signal processor 41 is connected to the signal 
generator 42, the memories 43, 44, 45, and 46, and the signal 
generator 49. The signal generator 42 is connected to the signal 
processor 47. The signal processor 47 is connected to the 
memories 8, 9, and 10. In addition, the signal processor 47 is 
connected to the memories 43, 44, 45, 46, and 50, the display 48, 
and the signal generator 49. 

In the apparatus of Fig. 8, it is preferable that the controller 6 
is continuously active. Thus, the camera 4 continues to be operated 
in the mode where the first, second, and third in-focus positions 
PI, P2, and P3 are periodically and cyclically taken by turns. The 
signal distributor 5 loads the memory 8 with a video signal 
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corresponding to the first in-focus position PI. The signal 
distributor 5 loads the memory 9 with a video signal corresponding 
to the second in-focus position P2. The signal distributor 5 loads 
the memory 10 with a video signal corresponding to the third in- 
5 focus position P3. 

As shown in Fig. 9, a frame 51 represented by each of the 
video signals in the memories 8, 9, and 10 is divided into a plurality 
of blocks 52 each having 8 by 8 pixels. The signal generator 49 
includes a clock signal generator, and a counter responsive to the 
1 0 output signal of the clock signal generator. The counter generates a 
block address signal periodically updated. The block address signal 
designates one from among the blocks composing one frame. The 
designated block is periodically changed from one to another so that 
all the blocks composing one frame are sequentially scanned. The 

1 5 signal generator 49 outputs the block address signal to the signal 

processors 41 and 47. 

For example, the signal processor 41 is of a programmable 
type. In this case, the signal processor 41 operates in accordance 
with a program stored in its internal ROM. The program is 

2 0 designed to enable the signal processor 41 to implement processes 

mentioned later. The signal processor 41 uses the memory 43 to 
implement the processes. For example, the signal processor 47 is 
of a programmable type. In this case, the signal processor 47 
operates in accordance with a program stored in its internal ROM. 
2 5 The program is designed to enable the signal processor 47 to 
implement processes mentioned later. 



The signal processor 41 reads out a portion of the video signal 
from the memory 8 in response to the block address signal. 
Specifically, the read-out video signal portion corresponds to the 
block designated by the block address signal. The signal processor 
41 subjects the block-corresponding video signal portion to DCT 
(discrete cosine transform) according to the following equations. 

1 i J, (2x+l)n7C (2y+l)m7C 

Anm = 4CnC m £ X ^Y) cos 16 cos 16 

x=0y=0 

•■•(58) 

C n Cm = -T= forn,m = 0 -(5 9A) 

a/2 

CnCm = 1 otherwise •••( 5 9 B ) 

where f(x,y) denotes the block-corresponding video signal portion 
on a pixel-by-pixel basis. The DCT provides 64 DCT coefficients 
Anm which are arranged in a 8-by-8 matrix 54 as shown in Fig. 9. 
In the matrix 54, a DCT coefficient located at the uppermost and 
leftmost position corresponds to a DC signal component. A DCT 
coefficient at a position closer to the lowermost and rightmost 
position corresponds to a higher-frequency AC signal component. A 
variable and shiftable window region (a variable and shiftable band 
region) corresponding to a movable frequency band is set in the 
matrix. This process corresponds to operation of the signal 
generator 42. In Fig. 9, the window region (the band region) is 
illustrated as the dotted area in the matrix 54. Specifically, two 
parallel slant lines LA56 and LB57 are set in the matrix 54. DCT 
coefficients at positions on the lines LA56 and LB57, and DCT 
coefficients at positions between the lines LA56 and LB57 compose 
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the band region (the window region). The lines LA56 and LB57 are 
selected from among 14 parallel slant lines illustrated as broken 
lines in the matrix 54 in Fig. 9. To vary and shift the band region, 
the selected lines LA56 and LB57 are changed. Accordingly, the 
5 width of the band region is variable while the central position 

thereof is shiftable. The signal processor 41 receives a signal from 
the signal generator 42 which indicates a current band region. The 
signal processor 41 summates the squares of DCT coefficients at 
positions in the current band region according to the following 
1 0 equation. 

S = X(A nm ) 2 ■-(60) 
The signal processor 41 uses "SI" as an indication of the summation 
result. The signal processor 41 stores data representative of the 
summation result SI into the memory 44. 

1 5 The signal processor 41 implements signal processing for the 

video signal in the memory 9 and the video signal in the memory 10 
similar to the above-mentioned processing for the video signal in 
the memory 8. Regarding the video signal in the memory 9, the 
signal processor 41 stores data representative of a summation result 

2 0 S2 into the memory 45. Regarding the video signal in the memory 

10, the signal processor 41 stores data representative of a 
summation result S3 into the memory 46. 

The signal processor 47 reads out the data representative of 
the summation results SI, S2, and S3 from the memories 44, 45, 
2 5 and 46. The signal processor 47 calculates the variance T of the 
summation results SI, S2, and S3 according to the following 
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equation. 

1 3 

T = ^ £ (Sk - S0)2 .-.(6 1 ) 

k=l 

where SO denotes a mean of the summation results SI, S2, and S3. 
5 The signal processor 47 stores data representative of the calculated 
variance T into the memory 50. The signal processor 47 is 
informed of the current band region by the signal generator 42. 
The signal processor 47 stores data representative of the current 
band region into the memory 50. The signal processor 47 may 
1 0 store data representative of the current band region into the 
memory 43. Then, the signal processor 47 outputs a change 
requirement signal to the signal generator 42. 

The signal generator 42 updates its output signal in response 
to the change requirement signal, thereby varying or shifting the 

1 5 band region to set a new band region. The signal processors 4 1 and 

47 repeat the previously-mentioned signal processing for the new 
band region (the current band region). The signal processor 47 
calculates a new variance T(new). The signal processor 47 reads out 
the data representative of the previous variance T(old) from the 

2 0 memory 50. The signal processor 47 compares the new variance 

T(new) and the previous variance T(old) with each other. When the 
new variance T(new) is greater than the previous variance T(old), 
the signal processor 47 replaces the data of the previous variance 
T(old) in the memory 50 with data of the new variance T(new) to 
2 5 update the variance data in the memory 50. In addition, the signal 
processor 47 replaces the data of the previous band region in the 
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memoiy 50 with data of the new band region (the current band 
region) to update the band region data in the memory 50. In the 
case where the band region data are stored in the memory 43, the 
signal processor 47 replaces the data of the previous band region in 
5 the memory 43 with data of the new band region (the current band 
region) to update the band region data in the memory 43. On the 
other hand, when the new variance T(new) is equal to or smaller 
than the previous variance T(old), the signal processor 47 does not 
update the variance data in the memory 50 and the band region data 
1 0 in the memory 50 (or the memory 43). Thus, in this case, the data 
of the previous variance and the data of the previous band region 
remain in the memory 50 (or the memories 43 and 50) as they are. 
Then, the signal processor 47 outputs a change requirement signal 
to the signal generator 42. 

1 5 The signal processors 41 and 47 iterate the previously- 

mentioned signal processing while the band region (the window 
region) represented by the output signal of the signal generator 42 
is shifted and varied. When the selected band region has been 
changed among all the possible band regions, data of the greatest 

2 0 variance are present in the memory 50 and also data of the band 

region corresponding to the greatest variance are present in the 
memory 50 (or the memory 43). The signal processor 41 accesses 
the memory 50 (or the memory 43), and gets the information of the 
greatest- variance band region. The signal processor 41 obtains the 
2 5 summation results SI, S2, and S3 for the greatest- variance band 
region. The signal processor 41 stores the data of the summation 
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result SI, the data of the summation result S2, and the data of the 
summation result S3 into the memories 44, 45, and 46, 
respectively. The signal processor 47 reads out the data of the 
summation results SI, S2, and S3 from the memories 44, 45, and 
5 46. The signal processor 47 compares the summation results SI, 

52, and S3 to find the greatest of the summation results SI, S2, and 

53. When the summation result SI is the greatest, the signal 
processor 47 reads out a portion of the video signal from the 
memory 8 in response to the block address signal. When the 

1 0 summation result S2 is the greatest, the signal processor 47 reads 
out a portion of the video signal from the memory 9 in response to 
the block address signal. When the summation result S3 is the 
greatest, the signal processor 47 reads out a portion of the video 
signal from the memory 10 in response to the block address signal. 

1 5 Specifically, the read-out video signal portion corresponds to the 

block designated by the block address signal. The signal processor 
47 stores the read-out video signal portion into a memory within 
the display 48. In this way, one of the video signal portion in the 
memory 8, the video signal portion in the memory 9, and the video 

2 0 signal portion in the memory 10 which corresponds to the 

designated block and the greatest of the summation results SI, S2, 
and S3 is selected before being transferred to the memory within 
the display 48. 

Subsequently, the signal generator 49 updates the block 
2 5 address signal to change the designated block to next one. The 

signal processors 41 and 47 iterate the previously-mentioned signal 
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processing while the designated block is periodically changed from 
one to another. When all the blocks composing one frame have been 
scanned, the memory within the display 48 is loaded with a 
complete set of block-corresponding video signal portions which 
5 corresponds to one frame. The display 48 indicates an image 

represented by the complete set of the block-corresponding video 
signal portions. 

In general, DCT coefficients corresponding to higher 
frequencies are greater as the degree of focus for an object in an 
1 0 image represented by the related video signal increases. 

Accordingly, the summation result S 1 indicates the degree of focus 
for an object in a partial image represented by the related block- 
corresponding video signal portion in the memory 8. Similarly, the 
summation result S2 indicates the degree of focus for an object in a 

1 5 partial image represented by the related block-corresponding video 

signal portion in the memory 9. In addition, the summation result 
S3 indicates the degree of focus for an object in a partial image 
represented by the related block-corresponding video signal portion 
in the memory 10. The greatest of the summation results SI, S2, 

2 0 and S3 corresponds to the best focus. Accordingly, for each of the 

blocks composing one frame, the best-focus video signal portion is 
selected from among the block-corresponding signal portions in the 
memories 8, 9, and 10, and is then transferred to the memory 
within the display 48. As a result, the best-focus image is indicated 
2 5 by the display 48. In the DCT-coefficient matrix, the band region at 
which the variance T peaks is suited for accurate evaluation of the 
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degrees of focus on the basis of the summation results SI, S2, and 
S3. 

Third Embodiment 
Fig. 10 shows an object monitoring apparatus according to a 
5 third embodiment of this invention. The apparatus of Fig. 10 is 
similar to the apparatus of Fig. 8 except for design changes 
mentioned later. 

The apparatus of Fig. 10 includes a signal generator 42A 
instead of the signal generator 42 (see Fig. 8) . The apparatus of Fig. 
10 10 includes an input device 61 and a memory 62 connected to the 
signal generator 42A. 

The memory 62 stores data representing a plurality of 
different patterns of a window region (a band region). The input 
device 61 can be operated by a user. The input device 61 outputs a 

1 5 pattern selection signal to the signal generator 42A when being 

operated by the user. The signal generator 42A reads out a data 
piece from the memory 62 which represents a pattern designated 
by the pattern selection signal. Thus, the signal generator 42A 
selects one from among the different patterns in accordance with 

2 0 the pattern selection signal. The signal generator 42A sets a 

current window region (a current band region) of the selected 
pattern. The signal generator 42A produces a signal representative 
of the current window region (the current band region). The signal 
generator 42A outputs the window region signal to the signal 
25 processors 41 and 47. The signal generator 42A updates its output 
signal in response to a change requirement signal fed from the 
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signal processor 47, thereby shifting or varying the window region 
to set a new window region. 

Fig. 1 1 shows a first example of the pattern of the window 
region (the band region) 71 given by the signal generator 42A. The 
pattern in Fig. 1 1 conforms with a vertical line or a column in a 
DCT-coefficient matrix. In this case, the window region 71 is 
shifted from the leftmost column to the rightmost column during 
the scanning of the DCT-coefficient matrix. The pattern in Fig. 1 1 
is suited for objects optically changeable to a great degree in a 
vertical direction, for example, objects having horizontal stripes. 
When the user is interested in such objects, the user operates the 
input device 61 to select the pattern in Fig. 11. 

Fig. 12 shows a second example of the pattern of the window 
region (the band region) 72 given by the signal generator 42A. The 
pattern in Fig. 12 conforms with a horizontal line or a row in a DCT- 
coefficient matrix. In this case, the window region 72 is shifted 
from the uppermost row to the lowermost row during the scanning 
of the DCT-coefficient matrix. The pattern in Fig. 12 is suited for 
objects optically changeable to a great degree in a horizontal 
direction, for example, objects having vertical stripes. When the 
user is interested in such objects, the user operates the input 
device 61 to select the pattern in Fig. 12. 

Fourth Embodiment 
Fig. 13 shows an object monitoring apparatus according to a 
fourth embodiment of this invention. The apparatus of Fig. 13 is 
similar to the apparatus of Fig. 2 except for design changes 
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mentioned later. 

The apparatus of Fig. 13 includes a camera 4A instead of the 
camera 4 (see Fig. 2). The camera 4A has condensers 81, 
expansion- contraction members 82, and a CCD-based photoelectric 
5 conversion device 83. The condensers 81 are arranged in a lattice 
or a matrix. The condensers 81 are connected by the expansion- 
contraction members 82. The apparatus of Fig. 13 includes a driver 
84 for the expansion-contraction members 82. The driver 84 is 
connected to the controller 7. 

1 0 With reference to Fig. 14, a lens 91 is separate from an object 

92. The lens 91 has a focal point F99 extending on the optical axis 
100 thereof. Three different projection planes 93, 94, and 95 are 
considered which extend rearward of the focal point F99, and 
which are arranged in that order. First light coming from the object 
15 92 and being parallel with the optical axis 100 travels through the 
lens 91 before passing through the focal point F99. Second light 
coming from object 92 toward the center of the lens 91 travels 
straight. An image of the object 92 is formed at a position where 
the first light and the second light intersect. In Fig. 14, the 

2 0 intersection position coincides with the projection plane 94. 

Accordingly, at the projection plane 94, an image 97 of the object 
92 is in focus. On the other hand, at the projection planes 93 and 
95 extending frontward and rearward of the projection plane 94, 
images 96 and 98 of the object 92 are fuz2y while being centered at 
2 5 the intersections between the projection planes 93 and 95 and the 
straight line passing through the object 92 and the center of the 
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lens 91. At such a projection plane closer to the focal point F99, a 
smaller image of the object 92 is formed thereon. Therefore, the 
image size varies in accordance with which of the projection planes 
93, 94, and 95 is taken. Thus, the image size varies in accordance 
5 with which of the first, second, and third in-focus positions PI, P2, 
and P3 is taken. The apparatus of Fig. 13 features a structure 
compensating for the image size variation. The compensation 
structure will be described below. 

As shown in Fig. 15, the condenser 81 includes a lens 101, a 
1 0 prism 102, and an optical finer cable 103. Light condensed by the 
lens 101 is transmitted to the optical fiber cable 103 by the prism 
102. The light is propagated along the optical fiber cable 103 
before reaching a corresponding segment, for example, a 
corresponding pixel segment, of the photoelectric conversion 

1 5 device 83 (see Fig. 13). 

As shown in Fig. 16, the expansion-contraction member 82 
includes a spring 111, a shape memory alloy member 112, a heater 
113, and connectors 114A and 114B. The connectors 114A and 
114B are coupled with adjacent condensers 81 respectively. The 

2 0 spring 111 and the shape memory alloy member 112 are provided 

between the connectors 114A and 114B. The heater 113 is 
associated with the shape memory alloy member 112. The heater 
113 is electrically connected to the driver 84 (see Fig. 13). The 
spring 111 urges the connectors 1 14A and 1 14B in the direction 
2 5 toward each other. The shape memory alloy member 112 forces the 
connectors 1 14A and 1 14B in the direction away from each other 
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when being heated by the heater 113. 

With reference back to Fig. 13, the driver 84 receives an 
active control signal from the controller 7. The driver 84 
recognizes which of the first, second, and third in-focus positions 
5 PI, P2, and P3 is currently taken by referring to the active control 
signal. The driver 84 controls the heaters 113 within the 
expansion-contraction members 82 in response to the active 
control signal fed from the controller 7, that is, in response to 
which of the first, second, and third in-focus positions PI, P2, and 
10 P3 is currently taken. As the heaters 113 within the expansion- 
contraction members 82 are activated or deactivated by the driver 
84, the distances between the condensers 81 change and hence the 
effective size of an image formed on the photoelectric conversion 
device 83 varies. The control of the heaters 1 13 by the driver 84 is 

1 5 designed to compensate for the previously-mentioned image-size 

variation which would be caused by change among the first, second, 
and third in-focus positions PI, P2, and P3. 

In the apparatus of Fig. 13, it is unnecessary for the signal 
distributor 5 to execute compensation for the image-size variation. 

2 0 Fifth Embodiment 

Fig. 17 shows a portion of an object monitoring apparatus 
according to a fifth embodiment of this invention. The apparatus of 
Fig. 17 is similar to the apparatus of Fig. 2 or Fig. 8 except for 
design changes mentioned later. 
2 5 The apparatus of Fig. 17 includes a camera 4B instead of the 

camera 4 (see Fig. 2 or Fig. 8). The camera 4B has a combination 



-35- 



lens 121, partition walls 122A and 122B, condensers 123A, 123B, 
and 123C, and a photoelectric conversion device 3B. 

The combination lens 121 is composed of segments 12 1A, 
12 IB, and 12 1C. As shown in Fig. 18, there are original lenses 131, 
5 132, and 133 having different focal lengths respectively. Central 
portions of the original lenses 131, 132, and 133 are cut out. The 
central portions of the original lenses 131, 132, and 133 are 
combined into the combination lens 121. Specifically, the segments 
12 1A, 12 IB, and 12C of the combination lens 121 are formed by the 
1 0 central portions of the original lenses 131, 132, and 133 
respectively. 

The partition walls 122A and 122B separate optical paths 
from each other which extend between the combination lens 121 
and the condensers 123A, 123B, and 123C. The condensers 123A, 

1 5 123B, and 123C are optically coupled with first, second, and third 

segments of the photoelectric conversion device 3B, respectively. 
The first, second, and third segments of the photoelectric 
conversion device 3B are connected to the first, second, and third 
memories 8, 9, and 10 respectively. 

2 0 Light passing through the segment 12 1A of the combination 

lens 121 enters the condenser 123A, and then reaches the first 
segment of the photoelectric conversion device 3B and forms an 
image thereon. The first segment of the photoelectric conversion 
device 3B converts the image into a corresponding video signal, 
2 5 which is stored into the memory 8. 

Light passing through the segment 12 IB of the combination 
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lens 121 enters the condenser 123B, and then reaches the second 
segment of the photoelectric conversion device 3B and forms an 
image thereon. The second segment of the photoelectric 
conversion device 3B converts the image into a corresponding video 
5 signal, which is stored into the memory 9. 

Light passing through the segment 12 1C of the combination 
lens 121 enters the condenser 123C, and then reaches the third 
segment of the photoelectric conversion device 3B and forms an 
image thereon. The third segment of the photoelectric conversion 
1 0 device 3B converts the image into a corresponding video signal, 
which is stored into the memory 10. 

It is unnecessary for the apparatus of Fig. 17 to periodically 
and cyclically change the lens position. 

Sixth Embodiment 

1 5 Fig. 19 shows a portion of an object monitoring apparatus 

according to a sixth embodiment of this invention. The apparatus of 
Fig. 19 is similar to the apparatus of Fig. 17 except for design 
changes mentioned later. 

The apparatus of Fig. 19 includes a light receiving unit 143 in 

2 0 which the combination lens 121 is provided. The light receiving 

unit 143 also contains the partition walls 122A and 122B, and the 
condensers 123A, 123B, and 123C (see Fig. 17). An optical fiber 
cable 141 connects the light receiving unit 143 and a detection unit 
142. Specifically, the output ends of the condensers in the light 
2 5 receiving unit 143 are optically coupled with inlet ends of the 

optical fiber cable 141. Outlet ends of the optical fiber cable 141 



are optically coupled with the photoelectric conversion device 3B 
which is provided on the detection unit 142. The first, second, and 
third segments of the photoelectric conversion device 3B are 
connected to the first, second, and third memories 8, 9, and 10 
provided in the detection unit 142, respectively. 

Since the detection unit 142 and the light receiving unit 143 
are connected by the optical fiber cable 141, it is possible to locate 
the units 142 and 143 at positions remarkably distant from each 
other. 

Seventh Embodiment 

Fig. 20 shows an object monitoring apparatus according to a 
seventh embodiment of this invention. The apparatus of Fig. 20 is 
similar to the apparatus of Fig. 19 except for design changes 
mentioned later. 

The apparatus of Fig. 20 includes a plurality of optical fiber 
cables 141(1), 141(2), and 141 (N), a plurality of detection 
units 142(1), 142(2), •-, and 142(N), and a plurality of light 
receiving units 143(1), 143(2), and 143(N), where "N" denotes 
a predetermined natural number, for example, 8. The detection 
units 142(1), 142(2), and 142(N) are connected to the light 
receiving units 143(1), 143(2), and 143(N) by the optical fiber 
cables 141(1), 141(2), and 141(N), respectively. 

Video signals outputted from the detection units 142(1), 
142(2), and 142(N) are combined into a multiple-image video 
signal by an image combining device 151. The multiple-image video 
signal is indicated by a multiple -image display 152. 
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As shown in Fig. 21, the light receiving units 143(1), 143(2), 
and 143(N) are mounted on a vehicle 180 so as to monitor the 
surroundings of the vehicle 180, that is, the surroundings of a 
rectangle defined by the body of the vehicle 180. The image 
5 combining device 151 and the multiple -image display 152 are 
placed in the vehicle 180. 

Eighth Embodiment 
Fig. 22 shows an object monitoring apparatus according to an 
eighth embodiment of this invention. The apparatus of Fig. 22 
1 0 includes a movable lens 201, an electrically-powered actuator 202, 
and a photoelectric conversion device 203 provided in a camera or 
an image capturing device 204. The lens 201 is located in front of 
the photoelectric conversion device 203. The actuator 202 
operates to move the lens 201 relative to the photoelectric 

1 5 conversion device 203. Light passes through the lens 201 before 

reaching the photoelectric conversion device 203 and forming 
thereon an image of a scene extending in front of the camera 204. 
The photoelectric conversion device 203 converts the image into a 
corresponding video signal. The photoelectric conversion device 

2 0 203 outputs the video signal. The photoelectric conversion device 

203 implements periodical scanning so that the video signal 
represents a sequence of frames. The photoelectric conversion 
device 203 is of, for example, a CCD-based type. 

The apparatus of Fig. 22 further includes a signal processor 
2 5 210, a display 212, and an operation unit 214. The signal processor 
210 includes a combination of an input/ output port 210A, a 
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processing section 21 OB, a ROM 2 IOC, and a RAM 210D. The signal 
processor 210 operates in accordance with a program stored in the 
ROM 2 IOC. 

The input/ output port 21 OA within the signal processor 210 
5 is connected to the photoelectric conversion device 203. The 
input/ output port 21 OA receives the video signal from the 
photoelectric conversion device 203. As will be made clear later, 
the device 210 processes the received video signal. 

The input/output port 21 OA within the signal processor 210 
10 is connected to the actuator 202. The input/output port 21 OA 
outputs a drive signal to the actuator 202. As will be made clear 
later, the signal processor 210 controls the actuator 202. 

The input/output port 21 OA within the signal processor 210 
is connected to the display 212. As will be made clear later, the 

1 5 input/output port 21 OA outputs a processing-resultant video signal 

to the display 212. The processing-resultant video signal is 
visualized by the display 212. The signal controller 210 can control 
the display 212. 

The input/output port 210A within the signal processor 210 

2 0 is connected to the operation unit 214. The operation unit 214 can 

be actuated by a user. The operation unit 214 outputs a turn-on 
signal or a turn-off signal to the input/output port 210A when being 
actuated by the user. 

The actuator 202 can change the position of the lens 201 
2 5 relative to the photoelectric conversion device 203 among three 

different positions. Thus, the actuator 202 can change the distance 



-40- 



between the lens 201 and the photoelectric conversion device 203 
among three different values. According to the distance change, the 
plane on which the camera 204 is focused is changed among three 
separate positions (first, second, and third in-focus positions) PI, 
5 P2, and P3. The first, second, and third in-focus positions PI, P2, 
and P3 are equal to the farthest, intermediate, and nearest positions 
as seen from the camera 204, respectively. 

As previously mentioned, the signal processor 210 operates in 
accordance with a program. Fig. 23 is a flowchart of a segment of 

1 0 the program which is started in response to a turn-on signal fed 

from the operation unit 214. 

As shown in Fig. 23, a first step 301 of the program segment 
controls the actuator 202 so that the second in-focus position P2 
will be taken. 

15 A step 302 following the step 301 processes the video signal 

fed from the photoelectric conversion device 203. Specifically, the 
step 302 subjects the video signal to a motion detection process. 
For example, the motion detection process is based on a 
comparison between two successive frames represented by the 

2 0 video signal. 

A step 303 subsequent to the step 302 decides whether a 
moving object is present in or absent from an image represented by 
the video signal. When a moving object is absent, the program 
jumps from the step 303 to a step 314. Otherwise, the program 
2 5 advances from the step 303 to a step 304. 

The step 304 stores a 1 -frame-corresponding segment of the 
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video signal into a second area within the RAM 210D. 

A step 305 following the step 304 controls the actuator 202 
so that the third in-focus position P3 will be taken. 

A step 306 subsequent to the step 305 subjects a 1-frame- 
5 corresponding segment of the video signal to image-size correction 
to generate a correction-resultant video signal. The image-size 
correction is designed to provide an image size equal to that 
corresponding to the second in-focus position P2. 

A step 307 following the step 306 stores a 1-frame- 
1 0 corresponding segment of the correction-resultant video signal into 
a third area within the RAM 210D. 

A step 308 subsequent to the step 306 controls the actuator 
202 so that the first in-focus position PI will be taken. 

A step 309 following the step 308 subjects a 1-frame- 

1 5 corresponding segment of the video signal to image-size correction 

to generate a correction-resultant video signal. The image-size 
correction is designed to provide an image size equal to that 
corresponding to the second in-focus position P2. 

A step 310 subsequent to the step 309 stores a 1-frame- 

2 0 corresponding segment of the correction-resultant video signal into 

a first area within the RAM 210D. 

A step 311 following the step 310 reads out the video signals 
from the first, second, and third areas within the RAM 210D to get 
images represented thereby. The step 311 calculates the degrees of 
2 5 focus for the moving object regarding the respective images. The 
calculation of the focus degrees may use a technique in the second 
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embodiment of this invention which is based on the execution of 
DCT and the summations of DCT coefficients. 

A step 312 subsequent to the step 311 compares the 
calculated focus degrees with each other, and decides which of the 
5 images corresponds to the best focus in response to the comparison 
results. 

A step 313 following the step 312 outputs the video signal 
representative of the best-focus image to the display 212. The step 
313 controls the display 212 to indicate the best-focus image 
1 0 represented by the video signal. After the step 313, the program 
advances to the step 314. 

The step 3 14 decides whether or not a turn-off signal is fed 
from the operation unit 214. When a turn-off signal is fed from the 
operation unit 214, the program exits from the step 314 and then 

1 5 the current execution cycle of the program segment ends. 

Otherwise, the program returns from the step 314 to the step 301. 
Ninth Embodiment 
Fig. 24 shows an object monitoring apparatus according to a 
ninth embodiment of this invention. The apparatus of Fig. 24 

2 0 includes a movable lens 401, an electrically-powered actuator 402, 

and a photoelectric conversion device 403 provided in a camera or 
an image capturing device 404. The lens 401 is located in front of 
the photoelectric conversion device 403. The actuator 402 
operates to move the lens 401 relative to the photoelectric 
2 5 conversion device 403. Light passes through the lens 401 before 
reaching the photoelectric conversion device 403 and forming 
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thereon an image of a scene extending in front of the camera 404. 
The photoelectric conversion device 403 converts the image into a 
corresponding video signal. The photoelectric conversion device 
403 outputs the video signal. The photoelectric conversion device 
5 implements periodical scanning so that the video signal represents 
a sequence of frames. The photoelectric conversion device 403 is 
of, for example, a CCD-based type. 

The apparatus of Fig. 24 further includes a signal processor 
410, a display 412, and an operation unit 414. The signal processor 

1 0 410 includes a combination of an input/ output port 410A, a 

processing section 41 0B, a ROM 4 10C, and a RAM 410D. The signal 
processor 410 operates in accordance with a program stored in the 
ROM 410C. 

The input/output port 410A within the signal processor 410 
15 is connected to the photoelectric conversion device 403. The 
input/ output port 41 OA receives the video signal from the 
photoelectric conversion device 403. As will be made clear later, 
the device 410 processes the received video signal. 

The input/output port 410A within the signal processor 410 

2 0 is connected to the actuator 402. The input/output port 410A 

outputs a drive signal to the actuator 402. As will be made clear 
later, the signal processor 410 controls the actuator 402. 

The input/ output port 41 OA within the signal processor 410 
is connected to the display 412. As will be made clear later, the 
2 5 input/output port 41 OA outputs a processing-resultant video signal 
to the display 412. The processing-resultant video signal is 
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visualized by the display 412. The signal processor 410 can control 
the display 412. 

The input/ output port 41 OA within the signal processor 410 
is connected to the operation unit 414. The operation unit 414 can 
5 be actuated by a user. The operation unit 414 outputs a turn-on 

signal or a turn-off signal to the input/ output port 41 OA when being 
actuated by the user. 

The actuator 402 can change the position of the lens 40 1 
relative to the photoelectric conversion device 403 among three 
1 0 different positions. Thus, the actuator 402 can change the distance 
between the lens 401 and the photoelectric conversion device 403 
among three different values. According to the distance change, the 
plane on which the camera 404 is focused is changed among three 
separate positions (first, second, and third in-focus positions) PI, 

1 5 P2, and P3. The first, second, and third in-focus positions PI, P2, 

and P3 are equal to the farthest, intermediate, and nearest positions 
as seen from the camera 404, respectively. 

As previously mentioned, the signal processor 410 operates in 
accordance with a program. Fig. 25 is a flowchart of a segment of 

2 0 the program which is started in response to a turn-on signal fed 

from the operation unit 414. 

As shown in Fig. 25, a first step 501 of the program segment 
controls the actuator 402 so that the first in-focus position PI will 
be taken. 

2 5 A step 502 following the step 501 subjects a 1 -frame- 

corresponding segment of the video signal to image-size correction 
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to generate a correction-resultant video signal. The image-size 
correction is designed to provide an image size equal to that 
corresponding to the second in-focus position P2. 

A step 503 subsequent to the step 502 stores a 1-frame- 
5 corresponding segment of the correction-resultant video signal into 
a first area within the RAM 410D. 

A step 504 following the step 503 controls the actuator 402 
so that the second in-focus position PI will be taken. 

A step 505 subsequent to the step 504 stores a 1-frame- 
1 0 corresponding segment of the video signal into a second area within 
the RAM 410D. 

A step 506 following the step 505 controls the actuator 402 
so that the third in-focus position PI will be taken. 

A step 507 subsequent to the step 506 subjects a 1-frame- 

1 5 corresponding segment of the video signal to image-size correction 

to generate a correction-resultant video signal. The image-size 
correction is designed to provide an image size equal to that 
corresponding to the second in-focus position P2. 

A step 508 following the step 507 stores a 1-frame- 

2 0 corresponding segment of the correction-resultant video signal into 

a third area within the RAM 410D. 

A signal processing block 509 follows the step 508. After the 
block 509, the program advances to a step 510. 

The step 510 decides whether or not a turn-off signal is fed 
2 5 from the operation unit 414. When a turn-off signal is fed from the 
operation unit 414, the program exits from the step 510 and then 



-46- 



the current execution cycle of the program segment ends. 
Otherwise, the program returns from the step 510 to the step 501. 

As shown in Fig. 26, the signal processing block 509 has a 
first step 601 which follows the step 508 (see Fig. 25). The step 
5 601 initializes values J, K, and L to "1". In addition, the step 601 
initializes a value Tmax to "0". The value Tmax denotes a maximal 
variance. The value J designates one from among blocks composing 
one frame. Specifically, different values J (1, 2, 3, and JO) are 
assigned to blocks composing one frame, respectively. Accordingly, 

1 0 one of the values J designates one of the blocks. The value K 

designates one from among the first, second, and third areas within 
the RAM 410D or one from among the video signals in the first, 
second, and third areas within the RAM 410D. Specifically, the 
value K being "1" is assigned to the first area within the RAM 410D 
15 or the video signal in the first area within the RAM 410D. The value 
K being "2" is assigned to the second area within the RAM 410D or 
the video signal in the second area within the RAM 410D. The value 
K being "3" is assigned to the third area within the RAM 410D or 
the video signal in the third area within the RAM 410D. The value L 

2 0 designates one from among different window regions in a DCT- 

coefficient matrix. The window regions are different in position and 
size. The window regions correspond to different frequency bands, 
respectively. Specifically, different values L (1, 2, 3, and LO) 
are assigned to the widow regions, respectively. Accordingly, one of 
2 5 the values L designates one of the window regions as a selected 

window. Here, LO denotes a value equal to the total number of the 
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window regions. After the step 601, the program advances to a step 
602. 

The step 602 reads out a portion of the video signal from one 
of the first, second, and third areas within the RAM 410D which is 
5 designated by the value K. Specifically, the first area within the 
RAM 410D is designated when the value K is "1". The second area 
within the RAM 410D is designated when the value K is "2". The 
third area within the RAM 410D is designated when the value K is 
"3". The read-out video signal portion corresponds to the block 

1 0 designated by the value J. 

A step 603 following the step 602 subjects the block- 
corresponding video signal portion to DCT (discrete cosine 
transform) according to the previously-indicated equations (58), 
(59A), and (59B). 

15 A step 604 subsequent to the step 603 sets a band region (a 

window region) in the DCT- coefficient matrix which is designated 
by the value L. The step 604 summates the squares of DCT 
coefficients in the band region according to the previously-indicated 
equation (60). Thus, the step 604 gets the summation result S(K). 

2 0 A step 605 following the step 604 increments the value K by 

"1". A step 606 subsequent to the step 605 decides whether or not 
the value K exceeds "3". Wen the value K exceeds "3", the program 
advances from the step 606 to a step 607. Otherwise, the program 
returns from the step 606 to the step 602. 
2 5 As a result, the summation results S(l), S(2), and S(3) are 

generated by the step 604. 
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The step 607 calculates the variance T(L) of the summation 
results S(l), S(2), and S(3) according to the previously-indicated 
equation (61). 

A step 608 following the step 607 compares the calculated 
5 variance TfL) with a maximal variance Tmax. It should be noted that 
the initial value of the maximal variance Tmax is "0". When the 
calculated variance T(L) is greater than the maximal variance Tmax, 
the program advances from the step 608 to a step 609. Otherwise, 
the program jumps from the step 608 to a step 610. 
1 0 The step 609 updates the maximal variance Tmax. The step 

609 also updates a number Lmax corresponding to the maximum 
variance Tmax. The number Lmax indicates the greatest- variance 
band region. Specifically, the step 609 equalizes the maximal 
variance Tmax to the calculated variance TfL). The step 609 

1 5 equalizes the number Lmax to the value L. After the step 609, the 

program advances to the step 610. 

The step 610 increments the value L by "1". A step 611 
following the step 610 resets the value K to "1". A step 612 
subsequent to the step 611 decides whether or not the value L 

2 0 exceeds a predetermined number L0. Wen the value L exceeds the 

predetermined number L0, the program advances from the step 
612 to a step 613. Otherwise, the program returns from the step 
612 to the step 602. 

The step 613 gets information of the greatest-variance band 
2 5 region from the value Lmax. The step 613 retrieves the summation 
results S(l), S(2), and S(3) for the greatest-variance band region. 
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A step 614 following the step 613 compares the retrieved 
summation results S(l), S(2), and S(3), and hence finds the greatest 
of the summation results S(l), S(2), and S(3). 

A step 615 subsequent to the step 614 reads out a portion of 
5 the vide signal from one of the first, second, and third areas within 
the RAM 410D which corresponds to the greatest summation result. 
Specifically, the step 615 reads out a portion of the video signal 
from the first area within the RAM 410D when the summation 
result S(l) is the greatest. The step 615 reads out a portion of the 
1 0 video signal from the second area within the RAM 410D when the 
summation result S(2) is the greatest. The step 615 reads out a 
portion of the video signal from the third area within the RAM 410D 
when the summation result S(3) is the greatest. The read-out 
portion of the video signal corresponds to the block designated by 

1 5 the block number J. The step 615 outputs the read-out video signal 

portion to the display 412, and stores it into a memory within the 
display 412. 

A step 616 increments the value J by "1". A step 617 
following the step 616 resets the value L to "1". A step 618 

2 0 subsequent to the step 617 decides whether or not the value J 

exceeds a predetermined number JO. Wen the value J exceeds the 
predetermined number JO, the program advances from the step 
618 to the step 510 (see Fig. 25). Otherwise, the program returns 
from the step 618 to the step 602. 
2 5 As understood from the previous description, one of the video 

signal portion in the first area within the RAM 410D, the video 
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signal portion in the second area within the RAM 410D, and the 
video signal portion in the third area within the RAM 410D which 
corresponds to the designated block and the greatest of the 
summation results S(l), S(2), and S(3) is selected before being 
5 transferred to the memory within the display 412. 

Subsequently, the designated block is changed to next one. 
The previously-mentioned signal processing is iterated while the 
designated block is periodically changed from one to another. When 
all the blocks composing one frame have been scanned, the memory 
1 0 within the display 412 is loaded with a complete set of block- 
corresponding video signal portions which corresponds to one 
frame. The display 412 indicates an image represented by the 
complete set of the block-corresponding video signal portions. 
In general, DCT coefficients corresponding to higher 

1 5 frequencies are greater as the degree of focus for an object in an 

image represented by the related video signal increases. 
Accordingly, the summation result S(l) indicates the degree of 
focus for an object in a partial image represented by the related 
block- corresponding video signal portion in the first area within the 

2 0 RAM 410D. Similarly, the summation result S(2) indicates the 

degree of focus for an object in a partial image represented by the 
related block-corresponding video signal portion in the second area 
within the RAM 410D. In addition, the summation result S(3) 
indicates the degree of focus for an object in a partial image 
2 5 represented by the related block-corresponding video signal portion 
in the third area within the RAM 410D. The greatest of the 
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summation results S(l), S(2), and S(3) corresponds to the best 
focus. Accordingly, for each of the blocks composing one frame, the 
best-focus video signal portion is selected from among the block- 
corresponding signal portions in the first, second, and third areas 
5 within the RAM 410D, and is then transferred to the memory 
within the display 412. As a result, the best-focus image is 
indicated by the display 412. In the DCT-coefficient matrix, the 
band region at which the variance T peaks is suited for accurate 
evaluation of the degrees of focus on the basis of the summation 
1 0 results S(l), S(2), and S(3). 



